Fuzzy Analysis in Pitch-Class Determination for Polyphonic Audio Key Finding

نویسندگان

Ching-Hua Chuan

Elaine Chew

چکیده

This paper presents a fuzzy analysis technique for pitch class determination that improves the accuracy of key finding from audio information. Errors in audio key finding, typically incorrect assignments of closely related keys, commonly result from imprecise pitch class determination and biases introduced by the quality of the sound. Our technique is motivated by hypotheses on the sources of audio key finding errors, and uses fuzzy analysis to reduce the errors caused by noisy detection of lower pitches, and to refine the biased raw frequency data, in order to extract more correct pitch classes. We compare the proposed system to two others, an earlier one employing only peak detection from FFT results, and another providing direct key finding from MIDI. All three used the same key finding algorithm (Chew’s Spiral Array CEG algorithm) and the same 410 classical music pieces (ranging from Baroque to Contemporary). Considering only the first 15 seconds of music in each piece, the proposed fuzzy analysis technique outperforms the peak detection method by 12.18% on average, matches the performance of direct key finding from MIDI 41.73% of the time, and achieves an overall maximum correct rate of 75.25% (compared to 80.34% for MIDI key finding).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Audio Key Finding Using Faceg: Fuzzy Analysis with the Ceg Algorithm

Our key finding system consists of a series of O(n) realtime algorithms for determining key from polyphonic audio. The system comprises of two main parts as shown in Figure 1 [1]. The first part (the upper dashed box) generates pitch class information from audio using the standard FFT and a fuzzy analysis technique. The second component (the lower dashed box) uses the pitch class information to...

متن کامل

Improving Automatic Music Transcription Through Key Detection

In this paper, a method for automatic transcription of polyphonic music is proposed that exploits key information. The proposed system performs key detection using a matching technique with distributions of pitch class pairs, called Zweiklang profiles. The automatic transcription system is based on probabilistic latent component analysis, supporting templates from multiple instruments, as well ...

متن کامل

Estimating The Tonality Of Polyphonic Audio Files: Cognitive Versus Machine Learning Modelling Strategies

In this paper we evaluate two methods for key estimation from polyphonic audio recordings. Our goal is to compare between a strategy using a cognition-inspired model and several machine learning techniques to find a model for tonality (mode and key note) determination of polyphonic music from audio files. Both approaches have as an input a vector of values related to the intensity of each of th...

متن کامل

Alias-free, Multiresolution Sinusoidal Modeling for Polyphonic, Wideband Audio

In this paper, we describe an improved method of generating more accurate sinusoidal parameters famplitude, frequency, phaseg from a wideband polyphonic audio source in a multiresolution, nonaliased fashion. This significantly improves upon previous work of sinusoidal modeling that assumes a single-pitched monophonic source, such as speech or an individual musical instrument. In addition to a m...

متن کامل

Adaptive Harmonization and Pitch Correction of Polyphonic Audio Using Spectral Clustering

There are several well known harmonization and pitch correction techniques that can be applied to monophonic sound sources. They are based on automatic pitch detection and frequency shifting without time stretching. In many applications it is desired to apply such effects on the dominant melodic instrument of a polyphonic audio mixture. However, applying them directly to the mixture results in ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Fuzzy Analysis in Pitch-Class Determination for Polyphonic Audio Key Finding

نویسندگان

چکیده

منابع مشابه

Audio Key Finding Using Faceg: Fuzzy Analysis with the Ceg Algorithm

Improving Automatic Music Transcription Through Key Detection

Estimating The Tonality Of Polyphonic Audio Files: Cognitive Versus Machine Learning Modelling Strategies

Alias-free, Multiresolution Sinusoidal Modeling for Polyphonic, Wideband Audio

Adaptive Harmonization and Pitch Correction of Polyphonic Audio Using Spectral Clustering

عنوان ژورنال:

اشتراک گذاری